Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 39118 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.6 MiB |
| Average record size in memory | 176.0 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 10 |
| BOOL | 1 |
euribor3m is highly correlated with emp_var_rate and 1 other fields | High correlation |
emp_var_rate is highly correlated with euribor3m and 1 other fields | High correlation |
nr_employed is highly correlated with emp_var_rate and 1 other fields | High correlation |
df_index has unique values | Unique |
previous has 35058 (89.6%) zeros | Zeros |
Reproduction
| Analysis started | 2021-03-16 21:37:09.352716 |
|---|---|
| Analysis finished | 2021-03-16 21:38:05.672796 |
| Duration | 56.32 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 39118 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20606.27517 |
|---|---|
| Minimum | 0 |
| Maximum | 41187 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2061.85 |
| Q1 | 10296.25 |
| median | 20621.5 |
| Q3 | 30912.75 |
| 95-th percentile | 39137.15 |
| Maximum | 41187 |
| Range | 41187 |
| Interquartile range (IQR) | 20616.5 |
Descriptive statistics
| Standard deviation | 11894.35685 |
|---|---|
| Coefficient of variation (CV) | 0.5772201311 |
| Kurtosis | -1.200994653 |
| Mean | 20606.27517 |
| Median Absolute Deviation (MAD) | 10309.5 |
| Skewness | -0.001212727214 |
| Sum | 806076272 |
| Variance | 141475725 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 27975 | 1 | < 0.1% | |
| 9550 | 1 | < 0.1% | |
| 15693 | 1 | < 0.1% | |
| 13644 | 1 | < 0.1% | |
| 3403 | 1 | < 0.1% | |
| 1354 | 1 | < 0.1% | |
| 7497 | 1 | < 0.1% | |
| 5448 | 1 | < 0.1% | |
| 25926 | 1 | < 0.1% | |
| Other values (39108) | 39108 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 41187 | 1 | < 0.1% | |
| 41186 | 1 | < 0.1% | |
| 41185 | 1 | < 0.1% | |
| 41184 | 1 | < 0.1% | |
| 41183 | 1 | < 0.1% |
age
Real number (ℝ≥0)
| Distinct | 74 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.72825809 |
|---|---|
| Minimum | 17 |
| Maximum | 95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | 17 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 32 |
| median | 38 |
| Q3 | 47 |
| 95-th percentile | 57 |
| Maximum | 95 |
| Range | 78 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 9.791754256 |
|---|---|
| Coefficient of variation (CV) | 0.2464682502 |
| Kurtosis | 0.1940232706 |
| Mean | 39.72825809 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.6150391948 |
| Sum | 1554090 |
| Variance | 95.8784514 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 31 | 1884 | 4.8% | |
| 32 | 1787 | 4.6% | |
| 33 | 1783 | 4.6% | |
| 36 | 1731 | 4.4% | |
| 35 | 1705 | 4.4% | |
| 34 | 1679 | 4.3% | |
| 30 | 1651 | 4.2% | |
| 37 | 1411 | 3.6% | |
| 39 | 1403 | 3.6% | |
| 29 | 1385 | 3.5% | |
| Other values (64) | 22699 | 58.0% |
| Value | Count | Frequency (%) | |
| 17 | 1 | < 0.1% | |
| 18 | 12 | < 0.1% | |
| 19 | 28 | 0.1% | |
| 20 | 49 | 0.1% | |
| 21 | 83 | 0.2% |
| Value | Count | Frequency (%) | |
| 95 | 1 | < 0.1% | |
| 89 | 2 | < 0.1% | |
| 88 | 16 | < 0.1% | |
| 87 | 1 | < 0.1% | |
| 86 | 3 | < 0.1% |
job
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| admin. | |
|---|---|
| blue-collar | |
| technician | |
| services | |
| management | |
| Other values (7) |
| Value | Count | Frequency (%) | |
| admin. | 9844 | 25.2% | |
| blue-collar | 9126 | 23.3% | |
| technician | 6463 | 16.5% | |
| services | 3886 | 9.9% | |
| management | 2790 | 7.1% | |
| entrepreneur | 1426 | 3.6% | |
| self-employed | 1378 | 3.5% | |
| retired | 1304 | 3.3% | |
| housemaid | 990 | 2.5% | |
| unemployed | 933 | 2.4% | |
| Other values (2) | 978 | 2.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 9.006288665 |
| Min length | 6 |
marital
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| married | |
|---|---|
| single | |
| divorced | |
| unknown | 73 |
| Value | Count | Frequency (%) | |
| married | 23795 | 60.8% | |
| single | 10864 | 27.8% | |
| divorced | 4386 | 11.2% | |
| unknown | 73 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.834398487 |
| Min length | 6 |
education
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| university.degree | |
|---|---|
| high.school | |
| basic.9y | |
| professional.course | |
| basic.4y | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| university.degree | 11425 | 29.2% | |
| high.school | 9081 | 23.2% | |
| basic.9y | 5908 | 15.1% | |
| professional.course | 4974 | 12.7% | |
| basic.4y | 3900 | 10.0% | |
| basic.6y | 2244 | 5.7% | |
| unknown | 1569 | 4.0% | |
| illiterate | 17 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 19 |
|---|---|
| Median length | 11 |
| Mean length | 12.68446751 |
| Min length | 7 |
default
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| no | |
|---|---|
| unknown | |
| yes | 3 |
| Value | Count | Frequency (%) | |
| no | 30585 | 78.2% | |
| unknown | 8530 | 21.8% | |
| yes | 3 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 3.090367606 |
| Min length | 2 |
housing
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| yes | |
|---|---|
| no | |
| unknown | 936 |
| Value | Count | Frequency (%) | |
| yes | 20450 | 52.3% | |
| no | 17732 | 45.3% | |
| unknown | 936 | 2.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 2.642415256 |
| Min length | 2 |
loan
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| no | |
|---|---|
| yes | |
| unknown | 936 |
| Value | Count | Frequency (%) | |
| no | 32236 | 82.4% | |
| yes | 5946 | 15.2% | |
| unknown | 936 | 2.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.271639654 |
| Min length | 2 |
contact
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| cellular | |
|---|---|
| telephone |
| Value | Count | Frequency (%) | |
| cellular | 24285 | 62.1% | |
| telephone | 14833 | 37.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.379186052 |
| Min length | 8 |
month
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| may | |
|---|---|
| jul | |
| aug | |
| jun | |
| nov | |
| Other values (5) |
| Value | Count | Frequency (%) | |
| may | 13652 | 34.9% | |
| jul | 7054 | 18.0% | |
| aug | 5911 | 15.1% | |
| jun | 5206 | 13.3% | |
| nov | 3732 | 9.5% | |
| apr | 2543 | 6.5% | |
| mar | 440 | 1.1% | |
| sep | 270 | 0.7% | |
| oct | 202 | 0.5% | |
| dec | 108 | 0.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
day_of_week
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| thu | |
|---|---|
| mon | |
| wed | |
| tue | |
| fri |
| Value | Count | Frequency (%) | |
| thu | 8183 | 20.9% | |
| mon | 8094 | 20.7% | |
| wed | 7739 | 19.8% | |
| tue | 7657 | 19.6% | |
| fri | 7445 | 19.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
duration
Real number (ℝ≥0)
| Distinct | 1498 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 253.0746715 |
|---|---|
| Minimum | 0 |
| Maximum | 4918 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 100 |
| median | 177 |
| Q3 | 314 |
| 95-th percentile | 736 |
| Maximum | 4918 |
| Range | 4918 |
| Interquartile range (IQR) | 214 |
Descriptive statistics
| Standard deviation | 251.9674239 |
|---|---|
| Coefficient of variation (CV) | 0.9956248186 |
| Kurtosis | 18.77168096 |
| Mean | 253.0746715 |
| Median Absolute Deviation (MAD) | 92 |
| Skewness | 3.151387047 |
| Sum | 9899775 |
| Variance | 63487.58271 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 85 | 168 | 0.4% | |
| 90 | 164 | 0.4% | |
| 136 | 161 | 0.4% | |
| 73 | 159 | 0.4% | |
| 124 | 159 | 0.4% | |
| 111 | 157 | 0.4% | |
| 87 | 157 | 0.4% | |
| 72 | 155 | 0.4% | |
| 109 | 153 | 0.4% | |
| 106 | 153 | 0.4% | |
| Other values (1488) | 37532 | 95.9% |
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 3 | < 0.1% | |
| 4 | 11 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4918 | 1 | < 0.1% | |
| 3643 | 1 | < 0.1% | |
| 3631 | 1 | < 0.1% | |
| 3422 | 1 | < 0.1% | |
| 3366 | 1 | < 0.1% |
campaign
Real number (ℝ≥0)
| Distinct | 42 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.595505905 |
|---|---|
| Minimum | 1 |
| Maximum | 56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 56 |
| Range | 55 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.813804537 |
|---|---|
| Coefficient of variation (CV) | 1.08410639 |
| Kurtosis | 36.22037386 |
| Mean | 2.595505905 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.7284533 |
| Sum | 101531 |
| Variance | 7.917495972 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 16591 | 42.4% | |
| 2 | 10041 | 25.7% | |
| 3 | 5103 | 13.0% | |
| 4 | 2552 | 6.5% | |
| 5 | 1539 | 3.9% | |
| 6 | 939 | 2.4% | |
| 7 | 605 | 1.5% | |
| 8 | 388 | 1.0% | |
| 9 | 279 | 0.7% | |
| 10 | 221 | 0.6% | |
| Other values (32) | 860 | 2.2% |
| Value | Count | Frequency (%) | |
| 1 | 16591 | 42.4% | |
| 2 | 10041 | 25.7% | |
| 3 | 5103 | 13.0% | |
| 4 | 2552 | 6.5% | |
| 5 | 1539 | 3.9% |
| Value | Count | Frequency (%) | |
| 56 | 1 | < 0.1% | |
| 43 | 2 | < 0.1% | |
| 42 | 2 | < 0.1% | |
| 41 | 1 | < 0.1% | |
| 40 | 2 | < 0.1% |
pdays
Real number (ℝ≥0)
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 991.0291426 |
|---|---|
| Minimum | 0 |
| Maximum | 999 |
| Zeros | 7 |
| Zeros (%) | < 0.1% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 999 |
| Q1 | 999 |
| median | 999 |
| Q3 | 999 |
| 95-th percentile | 999 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 88.61100315 |
|---|---|
| Coefficient of variation (CV) | 0.08941311546 |
| Kurtosis | 119.6114372 |
| Mean | 991.0291426 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -11.02738556 |
| Sum | 38767078 |
| Variance | 7851.90988 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 999 | 38804 | 99.2% | |
| 3 | 86 | 0.2% | |
| 6 | 43 | 0.1% | |
| 2 | 35 | 0.1% | |
| 12 | 30 | 0.1% | |
| 10 | 23 | 0.1% | |
| 11 | 16 | < 0.1% | |
| 4 | 15 | < 0.1% | |
| 5 | 14 | < 0.1% | |
| 9 | 13 | < 0.1% | |
| Other values (9) | 39 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 7 | < 0.1% | |
| 1 | 9 | < 0.1% | |
| 2 | 35 | 0.1% | |
| 3 | 86 | 0.2% | |
| 4 | 15 | < 0.1% |
| Value | Count | Frequency (%) | |
| 999 | 38804 | 99.2% | |
| 22 | 2 | < 0.1% | |
| 16 | 2 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 14 | 4 | < 0.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1114832047 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 35058 |
| Zeros (%) | 89.6% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3419828429 |
|---|---|
| Coefficient of variation (CV) | 3.06757277 |
| Kurtosis | 15.42839576 |
| Mean | 0.1114832047 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.393408484 |
| Sum | 4361 |
| Variance | 0.1169522649 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 35058 | 89.6% | |
| 1 | 3791 | 9.7% | |
| 2 | 249 | 0.6% | |
| 3 | 12 | < 0.1% | |
| 4 | 5 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 35058 | 89.6% | |
| 1 | 3791 | 9.7% | |
| 2 | 249 | 0.6% | |
| 3 | 12 | < 0.1% | |
| 4 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6 | 1 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 4 | 5 | < 0.1% | |
| 3 | 12 | < 0.1% | |
| 2 | 249 | 0.6% |
poutcome
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| nonexistent | |
|---|---|
| failure | |
| success | 307 |
| Value | Count | Frequency (%) | |
| nonexistent | 35058 | 89.6% | |
| failure | 3753 | 9.6% | |
| success | 307 | 0.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.58484585 |
| Min length | 7 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2121044021 |
|---|---|
| Minimum | -3.4 |
| Maximum | 1.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | -3.4 |
|---|---|
| 5-th percentile | -1.8 |
| Q1 | -1.8 |
| median | 1.1 |
| Q3 | 1.4 |
| 95-th percentile | 1.4 |
| Maximum | 1.4 |
| Range | 4.8 |
| Interquartile range (IQR) | 3.2 |
Descriptive statistics
| Standard deviation | 1.486356203 |
|---|---|
| Coefficient of variation (CV) | 7.007663155 |
| Kurtosis | -0.976048548 |
| Mean | 0.2121044021 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | -0.8142475073 |
| Sum | 8297.1 |
| Variance | 2.209254763 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1.4 | 16223 | 41.5% | |
| -1.8 | 8873 | 22.7% | |
| 1.1 | 7762 | 19.8% | |
| -0.1 | 3665 | 9.4% | |
| -2.9 | 1447 | 3.7% | |
| -1.7 | 501 | 1.3% | |
| -3.4 | 295 | 0.8% | |
| -1.1 | 244 | 0.6% | |
| -3 | 98 | 0.3% | |
| -0.2 | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| -3.4 | 295 | 0.8% | |
| -3 | 98 | 0.3% | |
| -2.9 | 1447 | 3.7% | |
| -1.8 | 8873 | 22.7% | |
| -1.7 | 501 | 1.3% |
| Value | Count | Frequency (%) | |
| 1.4 | 16223 | 41.5% | |
| 1.1 | 7762 | 19.8% | |
| -0.1 | 3665 | 9.4% | |
| -0.2 | 10 | < 0.1% | |
| -1.1 | 244 | 0.6% |
cons_price_idx
Real number (ℝ≥0)
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93.59324845 |
|---|---|
| Minimum | 92.201 |
| Maximum | 94.767 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | 92.201 |
|---|---|
| 5-th percentile | 92.893 |
| Q1 | 93.075 |
| median | 93.897 |
| Q3 | 93.994 |
| 95-th percentile | 94.465 |
| Maximum | 94.767 |
| Range | 2.566 |
| Interquartile range (IQR) | 0.919 |
Descriptive statistics
| Standard deviation | 0.553068145 |
|---|---|
| Coefficient of variation (CV) | 0.00590927395 |
| Kurtosis | -0.8647694777 |
| Mean | 93.59324845 |
| Median Absolute Deviation (MAD) | 0.453 |
| Skewness | -0.1996067867 |
| Sum | 3661180.693 |
| Variance | 0.305884373 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 93.994 | 7762 | 19.8% | |
| 93.918 | 6678 | 17.1% | |
| 92.893 | 5771 | 14.8% | |
| 93.444 | 5171 | 13.2% | |
| 94.465 | 4374 | 11.2% | |
| 93.2 | 3598 | 9.2% | |
| 93.075 | 2442 | 6.2% | |
| 92.963 | 691 | 1.8% | |
| 92.201 | 593 | 1.5% | |
| 92.843 | 275 | 0.7% | |
| Other values (16) | 1763 | 4.5% |
| Value | Count | Frequency (%) | |
| 92.201 | 593 | 1.5% | |
| 92.379 | 103 | 0.3% | |
| 92.431 | 74 | 0.2% | |
| 92.469 | 163 | 0.4% | |
| 92.649 | 118 | 0.3% |
| Value | Count | Frequency (%) | |
| 94.767 | 16 | < 0.1% | |
| 94.601 | 61 | 0.2% | |
| 94.465 | 4374 | 11.2% | |
| 94.215 | 213 | 0.5% | |
| 94.199 | 167 | 0.4% |
cons_conf_idx
Real number (ℝ)
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -40.77193875 |
|---|---|
| Minimum | -50.8 |
| Maximum | -26.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | -50.8 |
|---|---|
| 5-th percentile | -47.1 |
| Q1 | -42.7 |
| median | -41.8 |
| Q3 | -36.4 |
| 95-th percentile | -36.1 |
| Maximum | -26.9 |
| Range | 23.9 |
| Interquartile range (IQR) | 6.3 |
Descriptive statistics
| Standard deviation | 4.269385747 |
|---|---|
| Coefficient of variation (CV) | -0.1047138272 |
| Kurtosis | -0.8181031648 |
| Mean | -40.77193875 |
| Median Absolute Deviation (MAD) | 4.4 |
| Skewness | 0.1442791336 |
| Sum | -1594916.7 |
| Variance | 18.22765466 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| -36.4 | 7762 | 19.8% | |
| -42.7 | 6678 | 17.1% | |
| -46.2 | 5771 | 14.8% | |
| -36.1 | 5171 | 13.2% | |
| -41.8 | 4374 | 11.2% | |
| -42 | 3598 | 9.2% | |
| -47.1 | 2442 | 6.2% | |
| -40.8 | 691 | 1.8% | |
| -31.4 | 593 | 1.5% | |
| -50 | 275 | 0.7% | |
| Other values (16) | 1763 | 4.5% |
| Value | Count | Frequency (%) | |
| -50.8 | 16 | < 0.1% | |
| -50 | 275 | 0.7% | |
| -49.5 | 61 | 0.2% | |
| -47.1 | 2442 | 6.2% | |
| -46.2 | 5771 | 14.8% |
| Value | Count | Frequency (%) | |
| -26.9 | 74 | 0.2% | |
| -29.8 | 103 | 0.3% | |
| -30.1 | 118 | 0.3% | |
| -31.4 | 593 | 1.5% | |
| -33 | 98 | 0.3% |
| Distinct | 311 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.766859553 |
|---|---|
| Minimum | 0.634 |
| Maximum | 5.045 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | 0.634 |
|---|---|
| 5-th percentile | 0.899 |
| Q1 | 1.405 |
| median | 4.857 |
| Q3 | 4.962 |
| 95-th percentile | 4.966 |
| Maximum | 5.045 |
| Range | 4.411 |
| Interquartile range (IQR) | 3.557 |
Descriptive statistics
| Standard deviation | 1.653674566 |
|---|---|
| Coefficient of variation (CV) | 0.4390061648 |
| Kurtosis | -1.147991651 |
| Mean | 3.766859553 |
| Median Absolute Deviation (MAD) | 0.107 |
| Skewness | -0.870283185 |
| Sum | 147352.012 |
| Variance | 2.73463957 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4.857 | 2868 | 7.3% | |
| 4.962 | 2610 | 6.7% | |
| 4.963 | 2485 | 6.4% | |
| 4.961 | 1902 | 4.9% | |
| 4.856 | 1210 | 3.1% | |
| 4.964 | 1175 | 3.0% | |
| 1.405 | 1166 | 3.0% | |
| 4.965 | 1069 | 2.7% | |
| 4.864 | 1044 | 2.7% | |
| 4.96 | 1013 | 2.6% | |
| Other values (301) | 22576 | 57.7% |
| Value | Count | Frequency (%) | |
| 0.634 | 6 | < 0.1% | |
| 0.635 | 29 | 0.1% | |
| 0.636 | 12 | < 0.1% | |
| 0.637 | 3 | < 0.1% | |
| 0.638 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5.045 | 9 | < 0.1% | |
| 5 | 7 | < 0.1% | |
| 4.97 | 172 | 0.4% | |
| 4.968 | 991 | 2.5% | |
| 4.967 | 643 | 1.6% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5175.150199 |
|---|---|
| Minimum | 4963.6 |
| Maximum | 5228.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 305.6 KiB |
Quantile statistics
| Minimum | 4963.6 |
|---|---|
| 5-th percentile | 5076.2 |
| Q1 | 5099.1 |
| median | 5195.8 |
| Q3 | 5228.1 |
| 95-th percentile | 5228.1 |
| Maximum | 5228.1 |
| Range | 264.5 |
| Interquartile range (IQR) | 129 |
Descriptive statistics
| Standard deviation | 64.01678691 |
|---|---|
| Coefficient of variation (CV) | 0.01237003458 |
| Kurtosis | 0.2550537963 |
| Mean | 5175.150199 |
| Median Absolute Deviation (MAD) | 32.3 |
| Skewness | -1.105565903 |
| Sum | 202441525.5 |
| Variance | 4098.149007 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 5228.1 | 16223 | 41.5% | |
| 5099.1 | 8488 | 21.7% | |
| 5191 | 7762 | 19.8% | |
| 5195.8 | 3665 | 9.4% | |
| 5076.2 | 1447 | 3.7% | |
| 4991.6 | 501 | 1.3% | |
| 5008.7 | 385 | 1.0% | |
| 5017.5 | 295 | 0.8% | |
| 4963.6 | 244 | 0.6% | |
| 5023.5 | 98 | 0.3% |
| Value | Count | Frequency (%) | |
| 4963.6 | 244 | 0.6% | |
| 4991.6 | 501 | 1.3% | |
| 5008.7 | 385 | 1.0% | |
| 5017.5 | 295 | 0.8% | |
| 5023.5 | 98 | 0.3% |
| Value | Count | Frequency (%) | |
| 5228.1 | 16223 | 41.5% | |
| 5195.8 | 3665 | 9.4% | |
| 5191 | 7762 | 19.8% | |
| 5176.3 | 10 | < 0.1% | |
| 5099.1 | 8488 | 21.7% |
y
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 305.6 KiB |
| 0 | |
|---|---|
| 1 | 3491 |
| Value | Count | Frequency (%) | |
| 0 | 35627 | 91.1% | |
| 1 | 3491 | 8.9% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp_var_rate | cons_price_idx | cons_conf_idx | euribor3m | nr_employed | y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 44 | blue-collar | married | basic.4y | unknown | yes | no | cellular | aug | thu | 210 | 1 | 999 | 0 | nonexistent | 1.4 | 93.444 | -36.1 | 4.963 | 5228.1 | 0 |
| 1 | 1 | 53 | technician | married | unknown | no | no | no | cellular | nov | fri | 138 | 1 | 999 | 0 | nonexistent | -0.1 | 93.200 | -42.0 | 4.021 | 5195.8 | 0 |
| 2 | 3 | 39 | services | married | high.school | no | no | no | cellular | apr | fri | 185 | 2 | 999 | 0 | nonexistent | -1.8 | 93.075 | -47.1 | 1.405 | 5099.1 | 0 |
| 3 | 5 | 30 | management | divorced | basic.4y | no | yes | no | cellular | jul | tue | 68 | 8 | 999 | 0 | nonexistent | 1.4 | 93.918 | -42.7 | 4.961 | 5228.1 | 0 |
| 4 | 6 | 37 | blue-collar | married | basic.4y | no | yes | no | cellular | may | thu | 204 | 1 | 999 | 0 | nonexistent | -1.8 | 92.893 | -46.2 | 1.327 | 5099.1 | 0 |
| 5 | 7 | 39 | blue-collar | divorced | basic.9y | no | yes | no | cellular | may | fri | 191 | 1 | 999 | 0 | nonexistent | -1.8 | 92.893 | -46.2 | 1.313 | 5099.1 | 0 |
| 6 | 8 | 36 | admin. | married | university.degree | no | no | no | cellular | jun | mon | 174 | 1 | 3 | 1 | success | -2.9 | 92.963 | -40.8 | 1.266 | 5076.2 | 1 |
| 7 | 9 | 27 | blue-collar | single | basic.4y | no | yes | no | cellular | apr | thu | 191 | 2 | 999 | 1 | failure | -1.8 | 93.075 | -47.1 | 1.410 | 5099.1 | 0 |
| 8 | 10 | 34 | housemaid | single | university.degree | no | no | no | telephone | may | fri | 62 | 2 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.864 | 5191.0 | 0 |
| 9 | 11 | 41 | management | married | university.degree | no | yes | no | cellular | aug | thu | 789 | 1 | 999 | 0 | nonexistent | 1.4 | 93.444 | -36.1 | 4.964 | 5228.1 | 0 |
Last rows
| df_index | age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp_var_rate | cons_price_idx | cons_conf_idx | euribor3m | nr_employed | y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 39108 | 41177 | 52 | self-employed | single | university.degree | unknown | yes | no | telephone | jun | fri | 73 | 1 | 999 | 0 | nonexistent | 1.4 | 94.465 | -41.8 | 4.967 | 5228.1 | 0 |
| 39109 | 41178 | 35 | technician | married | high.school | no | no | no | telephone | aug | fri | 243 | 1 | 999 | 0 | nonexistent | 1.4 | 93.444 | -36.1 | 4.966 | 5228.1 | 1 |
| 39110 | 41179 | 29 | technician | single | basic.9y | no | yes | no | cellular | may | mon | 214 | 1 | 999 | 0 | nonexistent | -1.8 | 92.893 | -46.2 | 1.299 | 5099.1 | 0 |
| 39111 | 41180 | 44 | services | married | high.school | unknown | yes | yes | cellular | aug | fri | 34 | 1 | 999 | 0 | nonexistent | 1.4 | 93.444 | -36.1 | 4.966 | 5228.1 | 0 |
| 39112 | 41182 | 24 | admin. | married | high.school | no | yes | no | cellular | may | thu | 118 | 4 | 999 | 1 | failure | -1.8 | 92.893 | -46.2 | 1.266 | 5099.1 | 0 |
| 39113 | 41183 | 59 | retired | married | high.school | unknown | no | yes | telephone | jun | thu | 222 | 1 | 999 | 0 | nonexistent | 1.4 | 94.465 | -41.8 | 4.866 | 5228.1 | 0 |
| 39114 | 41184 | 31 | housemaid | married | basic.4y | unknown | no | no | telephone | may | thu | 196 | 2 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.860 | 5191.0 | 0 |
| 39115 | 41185 | 42 | admin. | single | university.degree | unknown | yes | yes | telephone | may | wed | 62 | 3 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | 0 |
| 39116 | 41186 | 48 | technician | married | professional.course | no | no | yes | telephone | oct | tue | 200 | 2 | 999 | 0 | nonexistent | -3.4 | 92.431 | -26.9 | 0.742 | 5017.5 | 0 |
| 39117 | 41187 | 25 | student | single | high.school | no | no | no | telephone | may | fri | 112 | 4 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.859 | 5191.0 | 0 |